Compression Aware Physical Database Design
نویسندگان
چکیده
Modern RDBMSs support the ability to compress data using methods such as null suppression and dictionary encoding. Data compression offers the promise of significantly reducing storage requirements and improving I/O performance for decision support queries. However, compression can also slow down update and query performance due to the CPU costs of compression and decompression. In this paper, we study how data compression affects choice of appropriate physical database design, such as indexes, for a given workload. We observe that approaches that decouple the decision of whether or not to choose an index from whether or not to compress the index can result in poor solutions. Thus, we focus on the novel problem of integrating compression into physical database design in a scalable manner. We have implemented our techniques by modifying Microsoft SQL Server and the Database Engine Tuning Advisor (DTA) physical design tool. Our techniques are general and are potentially applicable to DBMSs that support other compression methods. Our experimental results on real world as well as TPC-H benchmark workloads demonstrate the effectiveness of our techniques.
منابع مشابه
Compression-Aware In-Memory Query Processing: Vision, System Design and Beyond
In-memory database systems have to keep base data as well as intermediate results generated during query processing in main memory. In addition, the e↵ort to access intermediate results is equivalent to the e↵ort to access the base data. Therefore, the optimization of intermediate results is interesting and has a high impact on the performance of the query execution. For this domain, we propose...
متن کاملSemantics-Aware Advanced OLAP Visualization of Multidimensional Data Cubes
Efficiently supporting advanced OLAP visualization of multidimensional data cubes is a novel and challenging research topic, which results to be of interest for a large family of data warehouse applications relying on the management of spatio-temporal (e.g., mobile) data, scientific and statistical data, sensor network data, biological data, etc. On the other hand, the issue of visualizing mult...
متن کامل“ Efficient on - line operating system controlled data compression for embedded systems ” and “ Thermal - Aware Modeling , Analysis , and Physical Design for Reliable
, pm University of Karlsruhe, am Fasanengarten, Building 50.34, Room -10 . Friday 10.03.2006 14:00 1 Abstract Memory is a scarce resource during embedded system design. Increasing memory often increases packaging costs, cooling costs, size, and power consumption. In this talk, we describe CRAMES, an efficient software-based RAM compression technique for embedded systems. The goal of CRAMES is t...
متن کاملAn NVM Aware MariaDB Database System and Associated IO Workload on File Systems
MariaDB is a community-developed fork of the MySQL relational database management system and originally designed and implemented in order to use the traditional spinning disk architecture. With Non-Volatile memory (NVM) technology now in the forefront and main stream for server storage (Data centers), MariaDB addresses the need by adding support for NVM devices and introduces NVM Compression me...
متن کاملCompressed Dictionaries: Space Measures, Data Sets, and Experiments
In this paper, we present an experimental study of the spacetime tradeoffs for the dictionary problem, where we design a data structure to represent set data, which consist of a subset S of n items out of a universe U = {0, 1, . . . , u − 1} supporting various queries on S. Our primary goal is to reduce the space required for such a dictionary data structure. Many compression schemes have been ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- PVLDB
دوره 4 شماره
صفحات -
تاریخ انتشار 2011